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Abstract 

For bin packing, the input consists of n items with sizes si,..., e [0, 1] which have to 
be assigned to a minimum number of bins of size 1. Recently, the second author gave an 
LP-based polynomial time algorithm that employed techniques from discrepancy theory 
to find a solution using at most OPT + 0{\ogOPT ■loglogOf’T’) bins. 

In this paper, we present an approximation algorithm that has an additive gap of 
only 0(log OP T) bins, which matches certain combinatorial lower bounds. Any further 
improvement would have to use more algebraic structure. Our improvement is based 
on a combination of discrepancy theory techniques and a novel 2-stage packing: first we 
pack items into containers; then we pack containers into bins of size 1. Apart from being 
more effective, we believe our algorithm is much cleaner than the one of Rothvoss. 


1 Introduction 

One of the classical combinatorial optimization problems that is studied in computer sci¬ 
ence is Bin Packing. It appeared as one of the prototypical NP-hard problems already in the 
book of Garey and Johnson |GJ79| but it was studied long before in operations research in 
the 1950’s, for example by IEis57l . We refer to the survey of Johnson |GGJ84| for a complete 
historic account. Bin packing is a good example to study the development of techniques 
in approximation algorithms as well. The 1970’s brought simple greedy heuristics such as 
First Fit, analyzed by Johnson |Joh73| which requires at most 1.7 • OPT 1 bins and First Fit 
Decreasing |JDU'''74| , which yields a solution with at most ^OPT -t4 bins (see ID6s07l for 
a tight bound of ^-OPT -t-1). Later, an asymptotic PTAS was developed by Fernandez de la 
Vega and Luecker iFdlVLSlI . One of their main technical contributions was an item group¬ 
ing technique to reduce the number of different item types. The algorithm of De la Vega and 
Luecker finds solutions using at most (1 -t- e)OPT -t- 0{^) bins, while the running time is ei¬ 
ther of the form if one uses dynamic programming or of the form 0{n ■ fic)) if one 

applies linear programming techniques. 

A big leap forward in approximating bin packing was done by Karmarkar and Karp in 
1982 IKK82I . First of all, they argue how a certain exponential size LP can be approximately 
solved in polynomial time; secondly they provide a sophisticated rounding scheme which 
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produces a solution with at most OPT + 0(log^ OPT] bins, corresponding to an asymptotic 
FPTAS. 

It will be convenient throughout this paper to allow a more compact form of input, where 
s e [0,1] ” denotes the vector of different item sizes and h e N” denotes the multiplicity vector, 
meaning that we have h, copies of item type i . In this notation we say that L ”= i is the total 
number of items. The linear program that we mentioned earlier is called the Gilmore-Gomory 
LP relaxation IEis57l[GG61l and it is of the form 

min{l^x I Ax > h,x > 0}. (1) 

Here, the constraint matruc A consists of all column vectors p e Z”q that satisfy p/5, < 1. 
The linear program has variables Xp that give the number of bins that should be packed 
according to the pattern p. 

We denote the value of the optimal fractional solution to {T) by OP Tf, and the value of the 
best integral solution by OPT. As we mentioned before, the linear program {TJ does have an 
exponential number of variables, but only n constraints. A fractional solution x of cost l^x < 
OPTf+5 can be computed in time polynomial in h, and 1/5 IKK82I usingthe Grotschel- 
Lovasz-Schrijver variant of the Ellipsoid method IGLS81I . An alternative and simpler way 
to solve the LP approximately is via the Plotkin-Shmoys-Tardos framework IPST95I or the 
multiplicative weight update method. See the survey of IAHK12I for an overview. 

The best known lower bound on the integrality gap of the Gilmore-Gomory LP is an in¬ 
stance where OPT = \OPTf'\ -i-1; Scheithauer and Terno IST97I conjecture that these in¬ 
stances represent the worst case additive gap. While this conjecture is still open, it is under¬ 
standable that the best approximation algorithms are based on rounding a solution to this 
amazingly strong Gilmore Gomory LP relaxation. Lor example, the Karmarkar-Karp algo¬ 
rithm operates in log n iterations in which one first groups the items such that only jT.i€in] 
many different item sizes remain; then one computes a basic solution x and buys [Xp\ times 
pattern p and continues with the residual instance. The analysis provides a Oflog^ OPT) 
upper bound on the additive integrality gap of l[D . 

The rounding mechanism in the recent paper of the second author IRotl3l uses an algo¬ 
rithm by Lovett and Meka that was originally designed for discrepancy minimization. The 
Lovett-Meka algorithm ILM12I can be conveniently summarized as follows: 

Theorem 1 (Lovett-Meka ’12). Let Vi,...,Vm be vectors with Xstart ^ [0> 1] ” (^nd param¬ 
eters Ai. Am >0 so thatY .^-1 Then in randomized polynomial time one can 

find a vector Xe„d e [0,1] ” so that \ {Xgnd - Xstan, Vj) \-^j' II Vj lb for all j e {1,..., m} and at 
least half of the entries ofxend ore in 0/1. 

Intuitively, the points Xend satisfying the linear constraints I <Xend“•^start) I ^ Ay • WvjWz 
form a polytope and the distance of the jth hyperplane to the start point is exactly Ay. 
Then the condition e~‘^J < ^ essentially says that the polytope is going to be “large 

enough”. The algorithm of ILM12I itself consists of a random walk through the polytope. Lor 
more details, we refer to the very readable paper of ILM12I . 
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The bin packing approximation algorithm of Rothvoss IRotl3l consists of logarithmically 
many runs of Lovett-Meka. To be able to use the Lovett-Meka algorithm effectively, Rothvoss 
needs to rebuild the instance in each iteration and “glue” clusters of small items together to 
larger items. His procedure is only able to do that for items that have size at most poiyiog(n) 
and each of the iterations incurs a loss in the objective function of 0(loglog n). In contrast 
we present a procedure that can even cluster items together that have size up to H(l). More¬ 
over, Rothvoss’ algorithm only uses two types of parameters for the error parameters, namely 
Aj e {0, Oj-y/loglogn)}. In contrast, we use the full spectrum of parameters to achieve only a 
constant loss in each of the logarithmically many iterations. 

1.1 Our contribution 

Our main contribution is the following theorem: 

Theorem 2. For any Bin Packing instance (s, b) with si.e [0,1], one can compute a 

solution with at most OPTf + 0(\ogOPTf) bins, where OP Tf denotes the optimal value of the 
Gilmore-Gomory LP relaxation. The algorithm is randomized and the expected running time 
is polynomial inYfl^.^ bi. 

The recent book of Williamson and Shmoys IWSllI presents a list of 10 open problems 
in approximation algorithms. Problem #3 in the list is whether the Gilmore-Gomory LP has 
a constant integrality gap; hence we make progress towards that question. 

We want to remark that the original algorithm of Karmarkar and Karp has an additive 
approximation ratio of 0(logOPr^Tog(maX;j {^})). For 3-pflrP'P'on instances where all item 

sizes are strictly between ^ and this results in an 0(log n) guarantee, which coincides with 
the guarantees of Rothvoss lRotl3l and this paper if applied to those instances. A paper of 
Eisenbrand et al. lEPRllI gives a reduction of those instances to minimizing the discrepancy 
of 3 permutations. Interestingly, shortly afterwards Newman and Nikolov INNN12I showed 
that there are instances of 3 permutations that do require a discrepancy of G (log n). It seems 
unclear how to realize those permutations with concrete sizes in a bin packing instance — 
however any further improvement for bin packing even in that special case with item sizes 
in ]|, would need to rule out such a realization as well. The second author is willing to 
conjecture that the integrality gap for the Gilmore Gomory LP is indeed ©(log n). 
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2 A 2-stage packing mechanism 


It is well-known that for the kind of approximation guarantee that we aim to achieve, one 
can assume that the items are not too tiny. In fact it suffices to prove an additive gap of 
Oflogmaxfn, 7 ^}) where n is the number of different item sizes and Smin is a lower hound 

^min 

on all item sizes. Note that in the following, “polynomial time’’ means always polynomial in 
the total number of items ^ bi. 

Lemma 3. Assume for a monotone function f, there is a polynomial timeOPTf+fimax{n, 
algorithm for Bin Packing instances is, b) with s e [0,1] ” and Si,..., > S min > 0. Then there 

is a polynomial time algorithm that for all instances finds a solution with at most OPTf + 
fiOPTf) + OilogOPTf) bins. 

For a proof, we refer to Appendix A. From now on we assume that we have n different 
item sizes with all sizes satisfying 5 , > Smin for some given parameter Smin (as a side remark, 
the reduction in Lemma[3]will choose Smin = Q( opr^ ))- Starting from a fractional solution x 

to {B our goal is to find an integral solution of cost Oflogmaxfn, Another useful 

standard argument is as follows; 

Lemma 4. Any bin packing instance (s, b) can be packed in polynomial time into at most 
Sjbi +1 bins. 

Proof. Simply assign the items greedily and open new bins only if necessary. If we end up 
with k bins, then at least fc - 1 of them are at least half full, which means that Sibi > 
y (fc - 1). Rearranging gives the claim. □ 

Now, we come to the main mechanism that allows us the improvement over Rothvoss lRotl3l . 
Consider an instance (s, b) and a fractional LP solution x. We could imagine the assignment 
of items in the input to slots in x as a fractional matching in a bipartite graph, where we have 
nodes i e [n] on the left hand side, each with demand bi and nodes [p, i) on the right hand 
side with supply Xp ■ pt. Instead, our idea is to employ a 2-stage packing: first we pack items 
into containers, then we pack containers into bins. Here, a container is a multiset of items. 
Before we give the formal definition, we want to explain our construction with a small ex¬ 
ample that is visualized in Figure[Il The example has n = 3 items of size s = (0.3,0.2,0.1) and 
multiplicity vector h = (2,1,7). Those items are assigned into containers Ci, C 2 , C 3 which also 
have multiplicities. In this case we have yci = TCa = 1 copies of the first two containers and 
yCs = 2 copies of the third container. Moreover, in our example we have 3 patterns pi, pz, ps 
each with fractional value Xp^ = Xp^ = Xp^ = For example, item 2 is packed into container 
Cl and that container is assigned with a fractional value of j each to pattern pz and pz,. The 
reader might have noticed that we do allow that some copies of item i^ are assigned to slots 
of a larger item iz. On the other hand, we have bz = 7 copies of item 3, but only 6 slots in 
containers that we could use. So there will be 1 unit that we won’t be able to pack. Similarly, 
we have ycs = 2 copies of container C 3 , but only | slots in the patterns. Later we will say that 
the deficiency of the 2 -stage packing is 1 • 53 - 1 - y • s(C 3 ) where siCz) is the size of container C 3 . 

Now, we want to give the formal definitions. We call any vector C e Z"q with SfCi < 1 
a container. Here Ci denotes the number of copies of item i that are in the container. The 
size of the container is denoted by 5(C) := CiSf. Let C:= [C e Z"q | 5 (C) < 1} be the set of 
all containers. As we will pack containers into bins, we want to define a pattern as a vector of 
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Figure 1: Example for assigning items to containers and containers to patterns. 


the form p e Z>q where pc denotes the number of times that the pattern contains container 
C. Of course the sum of the sizes of the containers should be at most 1, thus 

P:={peZ^ol E Pc-siC)<l\ 

CeC 


is set of all (valid) patterns. 

Now suppose we have an instance {s, b) and a fractional vector x e To keep track of 
which containers should be used in the intermediate packing step, we also need to maintain 
an integral vector y e Z^q. 

We say that a bipartite graph G = (V^ u is a packing graph if each v eV^uVr has 
an associated size s[v) e [0,1] and multiplicity multlf) e K>o, and the edge set is given by 
E = {{u, v) eV( yiVr\ s{u) < 5 ( 1 ;)}. An assignmentin a packing graph is a function a:E^ K>o 
so that for any v eV, we have Lee 5 (i;) ti[e) < multlf), where 5{v) denotes the set of edges 
incident to v. The deficiency of a packing graph is the total size of left nodes that fail to be 
packed in an optimal assignment. That is, 

def{G) := min | Y 5 ( 1 ;) • (mult{r’) - aldlf)))}. 

a assignment of G ye ^ 

The edge set of those graphs is extremely simple, so that one can directly obtain the defi¬ 
ciency as follows: 

Observation 1. Ear any packing graph, an optimal assignment a \ E ^ IR>o which attains 
defiG) can be obtained as follows: go through the nodes v e Vr in any order. Take the node 
ueVc of maximum size that has some capacities left and satisfies s{u) < s{v). Increase a{u, v) 
as much as possible. 
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In this paper we further restrict ourselves to left-integralpacking graphs — that is, for any 
veVf, mult(y) e Z>o. We construct two packing graphs: one responsible for the assignment 
of items to containers and one for assigning containers to bins. 

• Assigning items to containers: Given b e Z”Q,y e Z>q, we define a packing graph GiCfi, y) 
as follows. The left nodes of the graph are defined by = [n], with sizes Si and mul¬ 
tiplicities hi- The right nodes are defined by V,- = {{/, C) \ C e C,i e C] with the size of 
node (1,0 given by Si and multiplicity by yc ■ Ci. 

• Assigning containers to patterns: Given y e Z>q and x e IR>q, we define a packing graph 
G 2 (x, y). The left nodes are given by C with sizes given by the sizes of containers, and 
multiplicities yc- The right nodes are given by {{G, p) : C eC,p e V], with the size of 
node (G, p) given by 5(G) and the multiplicity by xc ■ Pc- 

We then define the deficiency of the pair (x, y) with item multiplicities b to be the sum 

defb(x,y) := def(Gi(fi,y))-tdef(G 2 (x,y)). 

In later sections we will often leave off the b to simplify notation. 

We should discuss why the 2-stage packing via the containers is useful. First of all, it is 
easy to find some initial configuration. 

Lemma 5. For any bin packing instance ( 5 , b), one can compute a “startingsolution” x e IR^q 
andye Z^q inpolynomial time so thatl^ x< OPTf-\- 1 anddej[x,y) = 0 with \supp{x)\ < n. 

Proof. As we already argued, one can compute a fractional solution x for O in polynomial 
time that has cost l^x < OPTf + 1. We simply use singleton containers {1} for all items i e 
{ 1 ,..., n} and set y{,□ 

Next, we argue that our notation of deficiency was actually meaningful in recovering an 
assignment of items to bins. 

Lemma 6. Suppose thatx £ Z^q, y £ Z>q are both integral. Then there is a packing of all items 
into at most 1 ^x - 1 - 2def[x, y) - 1 -1 bins. 

Proof Since x and y are both integral, all multiplicities in Gi and G 2 will be integral and we 
can find two integral assignments ai, 02 attaining def(x, y). Buy all the patterns suggested by 
X. Use a 2 to pack the containers in y. Then use ai to map the items to containers. There are 
some items that will not be assigned — their total size is def(Gi [b, y)). Moreover, there might 
also be containers in y that have not been assigned; their total size is def(G 2 (x, y)). We pack 
items and containers greedily into at most 2def(x, y) - 1 -1 many extra bins using LemmalU □ 

In each iteration of our algorithm, it will be useful for us to be able fix the integral part of 
X and focus solely on the fractional part. 

Lemma?. Suppose x e y e and be Z"^. Ifxp = [XpJ for all patterns p, then there ex¬ 
ist vectors y eZ^^, be Z"q withb< b so that deftix, y) = 0 and deft _i^{x-x, y- y) = deftix,y). 
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Proof. Let us imagine that we replace each node (C, p) in G 2 (x, y) with two copies, a “red” 
node and a “hlue” node. The red copy receives an integral multiplicity of multredlC-P) = 
XpPc while the hlue copy receives a fractional multiplicity of multbiuelCi p) = (^p ^ ^p) • Pc- 
Now we apply Observation [T] to find the best assignment a. Crucially, we set up the order 
of the right hand side nodes so that we first process the red integral nodes and then the 
blue fractional ones. Note that the assignment that this greedy procedure computes is op¬ 
timal and moreover, the assignments for red nodes will be integral. For each container C 
on the left, we define yc to be the total red multiplicity of its targets under this optimal 
assignment. Then def(G 2 {x,y)) = 0 and def(G 2 (x-x,y-y)) = def{G 2 (x,y)). In the graph 
Gi ib, y), all multiplicities are integral anyway, so we can trivially find an integral vector b so 
that def(Gi (h, y)) = 0 and def(Gi(h- h,y- y)) = def(Gi(fi,y)). □ 

Define supp(x) := {p eV : Xp > 0} as the support of x and frac(x) := {p e V : 0 < Xp < 1} 
as the patterns in p that are still fractional. Now we have enough notation to state our main 
technical theorem: 

Theorem 8. Let [s, b] be an instance with 5 i,..., > Smin > 0- Let y e Z>g and x e [0,1 

with \ supp{x) \ > Llog{-^), where L is a large enough constant. Then there is a randomized 

■Jmin 

polynomial time algorithm that finds y e Z>g and x e with l^x = l^x and defx,y) < 
defx, y) -H 0 ( 1 ) while \frac[x) \ < | \frac{x] \. 

While it will take the remainder of this paper to prove the theorem, the algorithm behind 
the statement can be split into the following two steps: 

(I) Rebuilding the container assignment: We will change the assignments for the pair (x, y) 
so that for every container in size class a the patterns in supp(x) use, they use nearly 
( 1 ) 1/2 cQpjgg^ while no individual pattern in supp(x) contains more than copies 
of the same container. 

{II) Application of Lovett-Meka: We will apply the Lovett-Meka algorithm to sparsify the 
fractional solution x. Here, the vectors vj that comprise the input for the LM-algorithm 
will correspond to sums over intervals of rows of the constraint matrix A. Recall that 
the error bound provided by Lovett-Meka crucially depends on the lengths || vj II 2 . The 
procedure in (/) will ensure that the Euclidean length of those vectors is small. 

Once we have proven Theorem[ 8 l the main result easily follows: 

Proof of Theorem^ We compute a fractional solution x to Q of cost 1 ^x < OPTf+l. In fact, 
we can assume that x is a basic solution to the LP and hence |supp(x)| < n. We construct a 
container assignment y consisting only of singletons, see Lemma [H Then for log(n) iter¬ 
ations, we first use Lemma [7] to split the current solution x as x = x“* - 1 - where x^p^ = 
[Xp\ and obtain a corresponding split y = -t- Then we run Theorem [ 8 ] with in¬ 

put (x^'^^‘^,y^'^^‘^) and denote the result by (x^'^^‘^,y^'^^‘^). Finally we update x := x“* - 1 - and 

y ;= yint + yfrac_ 

Assoonas |frac(x)| < 0(log j^), wecanjustbuyeverypatterninfrac(x). In each iteration 
the deficiency increases by at most 0(1). At the end, we use Lemma[ 6 ]to actually pack the 
items into bins. We arrive at a solution of cost OPTf + 0(logmax{n, 7 ^}) which is enough, 

L 'Jmin 

using LemmalSl □ 

We will describe the implementation of (7) in Section|3]and then (77) in SectionlH 
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3 Rebuilding the container assignment 


In this section we assume that we are given xe [0,1 with | supp (x) | = m. To ease notation, 
we will only write the nonzero parts of x, so that if supp{x) = {pi, p 2 ,Pm), then x = (Xp,.)^^. 

We update x hy altering the patterns that make up its support. Even though some patterns 
could become identical, we continue to treat them as separate patterns. 

Originally, we had defined A as the incidence matrix of the Gilmore Gomory LP in {TJ 
where the rows correspond to items. Due to our 2-stage packing, we actually consider the 
patterns to he multi-sets of containers, not items anymore. Hence, let us for the rest of the 
paper redefine the meaning of A. Now, the rows of A correspond to the containers in C 
ordered from largest to smallest, and columns represent the patterns in supp(x). As we per¬ 
form the grouping and container-forming operations, we update the columns of the matrix. 

The resulting columns then yield a new fractional solution x hy taking Xp^ copies of the pat¬ 
tern now in column i. 

We will now describe our grouping and container reassignment operations, keeping track 
of what happens to the fractional solution as well as to the corresponding matrix. 

First, we need a lemma that tells us how rebuilding the fractional solution affects the 
deficiency. To have some useful notation, define mult(C, x) := Lpep mult(C, p) = LpeP ^pVc 
to be the number of times that the patterns cover container CeC. 

Now, if Ls(c)>s yc ^ Lx(c)>s yc for all s > 0, then we write y<y. Moreover, if mult(C, x) > 

Lx(C)>s rnult(C,x) for all s > 0, then we write x > x. Observe that M y < y and x> x, then 
def(G 2 (x,y)) <def(G 2 (x,y)). 

Lemma 9. Now suppose that t(j>0 is such that 


^ mult{C, x) > 

5(C)>s 


'Ls{C)>smult[C, X) 
'LsiO^s^ultiC, X) 


^(7 


if s> a 
if s<a 


Then defx, y) < defx, y] + a-ter- 

Proof Let Go be the largest container of size at most a, and let x' be the vector representing 
to- copies of the pattern containing a single copy of Gq. Then x-i- x' > x, and so def{G 2 (x-i- 
x', y)) < def(G 2 (x, y)). But if y' is the vector representing ta copies of Go, then def{G 2 (x, y)) = 
def(G 2 (x + x' ,y + y')), since we can find an optimal assignment taking the containers in y' to 
those of x'. Since the total size of y' is almost a to-> we have def(G 2 (x,y)) < def(G 2 (x-i-x',y))-i- 
atfj < def(G 2 (x, y)) -i- crfcr. and therefore def(x, y) < def(x, y) + atij. □ 

If (7 is a power of 2, say a = 2“^ for ( e Z>o, then we say the size class of a is the set of items 
with sizes between f a and a. In this next lemma, we round containers in patterns down so 
that each container type in size class a is either not used at all or is used at least ^ times. 

Lemma 10 (Grouping). Let [s, b) be a bin packing instance with y e z£q and x e [0,1 [^. For 
any size class a and 5 >0, we can find x e [0,1 so that 

1 . l^x=l^x 


2. \supp{x)\ < \supp[x)\ 

3. For each container type C in size class a, either mult{C, x) = 0 or s{C) ■ mult{C, x)>5. In 
all other size classes, the multiplicities of containers in patterns do not change. 
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4. dej[x, y) < def[x, y) + 0(5). 

Proof. Assume containers are sorted by size, from largest to smallest. Define Ss to be tbe set 
of containers in size class a not satisfying condition (3) above. In other words, 

Ss '■= {C in size class cr | 0 < s{C) ■ mult{C, x) < 5}. 

For a subset H c Ss, define the weight of H to be wiH) := LceH'S(C') • mult(C,x). Note 
that the weight of a single container is at most 5. Hence we can partition Ss = Hi\jH20...0Hr 
so that: 

1. wiHk]e[25,35],\fk=l,...,r-l. 

2. wiHr)<3d. 

3. C e Hjc, C' G Hfc+i implies siC) > 5{C'). 

For each k= I, ..., r - 1 and container C g we replace containers of type C in all pat¬ 
terns p G frac{x) with the smallest container type appearing in H^. For all C g Hr, remove 
containers of type C from all patterns p e frac{x). Call the updated vector x. We see imme¬ 
diately that 1 ^ X = 1 ^ X and I supp (x) | < | supp (x) |. 

Moreover, since every container type C appearing in x now has an entire group using 
it, and the weight of each container didn’t change by more than a factor of 2, we have 5(C) • 
mult(C, x) > 6, and so condition (3) is satisfied. To complete the proof, it remains to show 
that def(G 2 (x,y)) < def(G 2 (x,y)) 4-0(5). 

Now, for any i, there is at most one group whose containers (partly) changed from be¬ 
ing larger than 5(C;) to smaller. The weight of this group is at most 35, andsoX/<i niult(Cy,x)- 
T.j<i mult(Cy, x) < ^. Since this holds for all i, we can therefore apply Lemma[9]to conclude 
thatdef(G 2 (x,y)) < def(G 2 (x,y))-i-0(5). □ 

We now remark what happens to the associated matrix A under this grouping operation. 
Write A, A as our original and updated matrices, and Ac, Ac as the rows for container C. For 
container types C in size class a, either Acx = 0 or 5(C) • Acx > 5. For all other size classes, 
Ac = Ac- In particular, notice that we have either || Ac II i = 0 or 5(C) • |l II i ^ 

Before we introduce the next main lemma — how to reassign containers — we prove a 
useful result about decomposing packing graphs in a nice way. For a visualization of the 
following lemma, see Figure|2] 

Lemma 11. Suppose G = (If u Vr,E) is a left-integral packing graph as in Section\^ and 
that for every v g Vy, we are given red and blue multiplicities so that mult[v) = multrediv) 4- 
multhiue(v). Suppose further that all nodes v eVy of size greater than a have multrediv) = 0. 
Then we can find left-integral packing graphs Gred vitid Gsiue with the same edges, nodes, 
and sizes ofG but with multiplicities satisfying multred + tnultsue = rnult. Moreover, we have 
defGred) ~ fi and defiGsiue) — defiG) 4- cr. 

Proof By allowing fractional red and blue multiplicities, we can find initial values for the red 
and blue multiplicities of left nodes so that def(Gred) = 0 and def(Gbiue) = def(G). To enforce 
integrality, we will update these multiplicities by swapping (fractional parts of) larger red 
nodes for smaller blue nodes. 

Suppose nodes on the left with positive red multiplicity are ordered by size, so that a > 
5(r'i) > 5 ( 1 ^ 2 ) ^ ••• ^ sivf). While the multiplicities are not all integral, let i be the index of 
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Figure 2: Visualization of the decomposition of the left hand side multiplicities from 
Lemma [TT] Here, V( = {vi,...,V 7 } and Vr = {ui,...,U 7 }. Assume that siUi) = s{Vi) and 
siz/i) > ... > 5 ( 1 ^ 7 ). Nodes are lahelled with their multiplicities. In {b) and (c) we also depict 
the assignments corresponding to the deficiencies. 


the largest Vi with multred(i^i) not integral. If i < £, decrease multred(2^;) to LmulhedCi^dJ. 
and increase multred(i'i+i) hy the same amount. If i = simply decrease multred(t’t') to 
Lmultred(2^^)J- Notice that the deficiency of the red graph has not increased, since we are 
either replacing nodes with smaller nodes or decreasing the multiplicity of the last node. 
Moreover, we notice that for any size s, the total red multiplicity of nodes at least size s has 
decreased hy at most 1. Therefore in the complementary blue graph, 

^ {mult(,j^g iv)- multbiue iv))<l. 

vEVe:s(v)>s 

The additional blue nodes we fail to pack will therefore all have size at most a and their total 
multiplicity will be at most 1 , so the deficiency of the blue graph increases by at most a. □ 

A key technical ingredient for our algorithm is to be able to replace sets of identical copies 
of a container in patterns of x by a bigger container that contains the union of the smaller 
containers. 

Lemma 12. Given a pair {x,y] with x £ IR>q and y e Z^g. Let k e N and 0 < a < I be two 
parameters. Let x e be the vector that emerges if for all containers C with jcr < s{C] < a 
and all patterns p we replace k ■ [^\ copies ofC by L^J copies of the container that is k ■ C. 
Then there isayeJ.^^ so that defx, y) < defx, y] + 0{ka). 

Proof Consider the graph G 2 (x, y) as in section [2J For every right node (C, p), we assign 
multj-ediC’, p) = k- L^J • Xp for C in size class a, and mulhedCC, p) = 0 for all other C. We set 


iUi) 
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super-container: 


patterns: 


Xpj — 0.4 


Xp2 =0.4 


Xp3 = 0.4 


Xp4 — 0.4 


Xp5 - 0.4 


^ reassignment 



Xpj = 0.4 Xp2 = 0.4 Xp3 = 0.4 Xp4 = 0.4 Xp^ = 0.4 

Figure 3: Visualization of the reassignment in LemmafT^for fc = 3. The upper packing graph 
is the red part of Gz [x, y) with the optimal assignment a, assuming that each container has 
multiplicity 1. The lower graph gives the red part of Gz [x, y) with the constructed assignment 
a that we give in the analysis. Darker colors indicate larger containers. 
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multbiueCC, p) = mult{C, p) - multred(C, p). By LemmafTTl we can find integral red and blue 
multiplicities of left nodes so that def(Gred) = 0 and def(Gbiue) ^ def(x, y) + a. The red and 
blue graphs can now be treated separately, and so we restrict our attention to the red graph 
since it represents precisely the containers that we want to reassign. 

For all nodes (G, p) on the right of the red graph, we combine the copies of G in pat¬ 
tern p into containers of type k ■ C. For clarity we refer to these larger containers as super¬ 
containers. Similarly, we look at the containers of the left nodes, ordered from largest to 
smallest and taken with multiplicity. In consecutive sets of cardinality k, we combine the 
containers into super-containers, except perhaps fewer than k of the smallest ones. Write G, 
to represent the ith largest super-container on the left. 

We claim that all super-containers except Gi can be packed into the right nodes. To see 
how to pack them, let a be an optimal assignment in the original red graph. For all i, a 
assigned the containers making up G/ to some combination of large-enough containers of 
total multiplicity k. All such containers became part of super-containers in the new graph, 
and the total multiplicity of their contribution to these super-containers is exactly 1. These 
super-containers are not necessarily all large enough to fit G, , but they are all large enough 
to fit G/+ 1 , and this is exactly where we send G/+i. With this assignment, at most one super¬ 
container and k containers were left unpacked, and so the deficiency of the updated red 
graph is at most 2ka. 

For all containers G, we let yc = mulled (O + multbiue(C')- We note that we only changed 
y by rearranging the containers, and in particular we did not change the item multiplicities. 
Therefore we know that def(Gi(y)) = def(Gi(y)). With this definition of y, we note that Gred + 
Gbiue is precisely G 2 (x, y). We therefore have def{G 2 (x, y)) < def{G 2 (x, y)) + 0[ka), and so the 
total increase in deficiency is at most 0{ka). □ 

We are now ready to give our second main lemma of this section. 

Lemma 13 (Reassigning containers). Suppose x £ IR>Q,y e and a < 2“^. Then we can 
combine containers in size class a in x and y into larger containers, yielding new solutions 
X, y satisfying the following conditions. 

1 . l^x = l^x. 

2. \supp{x)\ <\supp[x)\. 

3. For all patterns p e suppix) and containers C in size class o, pc ^ 

4. Multiplicities of small containers in patterns in suppix) are not affected. 

5. defx, y) < defx, y) -i- . 

Proof. We apply LemmafT^with parameter k = L(^)^^'^J and obtain a pair (x, y) so that def(x, y) < 
Oika) < and so condition (5) is satisfied. Since we have updated x by altering the 

patterns in its support, conditions (1) and (2) are also satisfied. In the process of LemmafT^ 
we decreased pc for G in size class a to at most k. Since a < 2“^, we know that fc > 2, and so 
the containers we created are in strictly larger size classes. Therefore conditions (3) and (4) 
are satisfied. □ 


12 


Let us say briefly what the container reassignment does to the associated matrix A. If Ac 
is any row of the updated matrix corresponding to a container in size class a, we know Ac is 
entrywise less than or equal to Ac and ll^clloo ^ In all rows corresponding to smaller 

size classes, Ac = Ac- 

Before we talk about applying Lovett-Meka, we want to summarize the results of our 
grouping and container reassignment. We summarize the procedure: 

(1) For size classes Smin ^ cr < 2“^^, starting with the smallest, do: 

(2) Group the containers in size class a with 5 = \/d. 

(3) Whenever we find more than copies of the same container in one pattern, 
we put them together in a larger container. 

(4) For a > 2“^^, group the containers in size class a with 6 = 64. 

In the following we will call a size class a small if u < 2“^^ and large otherwise. First note that 
the increase in deficiency of the entire procedure is at most 

X (0(cr^^2)-t0(f7^''^))-H72-64 = 0(l). 

o-eZ-'^ 

Let A be the matrix we obtain at the end of this procedure. In addition, we would like to 
keep much of the group structure that was created during the procedure. Define the shadow 
incidence matrix A to be the matrix that agrees with A on large size classes, but for small 
size classes represents the incidences after step (2), but before step (3). We can imagine that 
whenever a container is put into a larger container, its incidence entry remains in A. In 
particular a container might be put into containers iteratively and hence it may contribute 
to several incidences in A but only one in A. Note that A is entrywise at least as large as A. 

For all containers C e C, let Ac denote the row of A corresponding to C, and Ac the 
corresponding row of A. Recall that A and A contain columns for patterns in frac(x). Now, 
let us summarize the properties that the container-forming procedure provides: 

(A) For a container C in size class cr one has IIAclli > if tr is small, and ||Aclli = 

II Ac II 1 > 64 if (7 is large. 

(B) For a container C in a small size class a, and column j = 1,..., m, one has Acj < 

(C) One has 

E II Aq II1 • < 24 X II Aq II1 • 5(Q). 

!=1 1 = 1 

Here (A) follows from the fact that after step (2), we have incidences for each container. 

(B) follows since after step (3), there are at most containers of each type in a pattern. 
The condition in (C) can be understood as follows: if we have a container of size s(C), then 
the containers in it may appear many times in A but only in smaller size classes. By discount¬ 
ing smaller incidences, we can upper-bound the contribution of the shadow incidences by 
the contribution of the actual containers. 

To make this more concrete, consider a container C appearing in A in some size class. If 
this container came from k smaller containers, then those smaller containers are size at most 
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2 • Here the factor 2 comes from the fact that during grouping our container could have 
been rounded down hy a factor of 2. Therefore the contribution of the shadow incidences 
of these smaller containers to the left hand side is ^ . 2i7/i6^-i/i6 

we chose the parameters so that whenever we combine k containers we have k > 2^® and so 
the contribution is at most The shadow incidences £ levels down similarly 

contribute Then the total contribution of the shadows of C to the left 

hand side of property (C) is at most 

^ < 24-5(0^^'^® < 24-5(0. 

e>Q 

4 Applying the Lovett-Meka algorithm 

Using the grouping and container reassignment above, we can replace y with y and x with x 
so that the incidence matrix A and shadow matrix A satisfy properties (A) - (C). We now want 
to create intervals of the rows of A and A in a nice way so that we can apply Lovett-Meka and 
make x more integral. Formally, we will argue the following: 

Claim 14. Suppose x e [0, l[^,y e Z>q, A is the incidence matrix ofx, and A is a matrix so 
that A and A satisfy conditions (A) -t- {B) -f- (C). Then there is a randomized polynomial time 
algorithm to find a vector x satisfying 

• l^x= l^x 

• dejix, y) < defix, y) -t 0(1) 

• |^ac(x)| < ^|/fflc{x)| 

Suppose the containers appearing in the patterns in supp{x) are O,..., O, ordered from 
largest to smallest. As we fix the fractional solution x for now, let us denote n(Q) := Lpefrac(x) Pq 
II Aq 111 as the number of incidences of container C, in A. Similarly, let hii) = || Aq ||i be the 
number of incidences in the shadow matrix A. Again, we have n[i)< hii] for all i. Finally, let 
us denote ha := Y.i in class a h{i) as the total number of shadow incidences that occur for size 
class cr. 

For a fixed constant K>0, and for each small size class a, we first create level 0 intervals 
of the rows as follows. For any row i satisfying hii) > we let {i} be its own interval. 

We then subdivide the remaining rows into intervals so that hU) ^ for each interval 

I. We need a total of at most ^a^^^^^ha +1 intervals on level 0. 

Now, given an interval I on level £ with | /| > 1, we will subdivide I into at most 3 intervals 
on level £ ^\. First, for any row i e I with h{i) > , let {i} be its own interval. 

We then subdivide the remaining rows into intervals so that nil] < i^V Since none 

of the rows i e I became its own interval on level £, we also know that hii] < i\]^ 
and so in fact this bound holds for every interval on level £ +\. The number of intervals on 
level £ is at most 3^ • i^a^^'^^ha + 1). 

For large size classes o, create an interval for each row {i}. Due to the grouping proce¬ 
dure, the size of each interval is at least 64. All such intervals are level zero, and we do not 
create any higher levels. 

Let us abbreviate all intervals on level £ for size class o as Xaj . We denote Xa := \Je>oXaj 
as the whole family for size class a and X := as the union over all size classes. 
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For an interval I, we define the vector 


vr.= Y.Ai 
iel 


as the sum of the corresponding rows in the incidence matrix. 

For an interval I e we define A/ := ( (that means the parameter just denotes the 
level on which it lives). The input for the Lovett-Meka algorithm will consist of the pairs 
where we use A/ > 0 as the parameter for a constraint with normal vector vj. 
Additionally, we add a single vector i^obj := 1 with parameter Aobj := 0 to control the objective 
function. There are two things to show. First we argue that the parameters are chosen so that 
the condition of the Lovett-Meka algorithm is actually satisfied: 

Lemma 15. Suppose that\supp[x) \ > Llog{-^). ForK,L large enough constants, one has 

^min 

1 < J_ . \supp{x)\ 

IeI 16 


Proof. On level 0, we have \Xafi\ ^ -ha + l many intervals and hence on level £ >0 

there are IXq./! < 3^ • -ha + l) many. We can calculate that 




/el 


crsmall/>0 crlarge^aO 


I I- 

a small/>0 


K 


•ha + \)+ Y 

a large 


for/r,Llarge enough 1 1 1 

— |supp(x)|-t Y ®-na + —lsupp(x)| 


64 


128-24 


a small 


property (C) 
< 


1 1 ^ 1 

— |supp(x)|-i-) a-na-\ -|supp{x)| 

64 128 V 64 

3 1 

< —-IsuppCx)! < —-Isupplx)!-!. 

We used that the total size for each pattern is at most 1, and so the sum of the sizes of all 
incidences in the matrix A is at most | supp (x) |. □ 


Now, suppose we do run the Lovett-Meka algorithm and obtain a solution x with |frac(x) | < 
||frac(x)| so that 

I <r’ 7 ,x-x> I < A/-II it/lb V/eX and l^x=l^x. 

The following is crucial to our error analysis: the lengths || i/j || 2 that appear in the error bound 
are not too long and in particular the ratio decreases with smaller container sizes. 

Lemma 16. Fix an interval I e X^.e where a is small. Then || vi \\2 < h{I) ■ 

Proof. Recall that Vj = Y.ui Aq where each row Aq has a row-sum of || Aq || 1 < || Aq II 1 . We 
have h{i) = IIAqIIi > while ||Aq|Ioo^ Therefore, we have 

IIAc,II 2 < ^IIAc,lli-||Ac,lloo < ^IIAc,lli-||Ac,lloo = II Ac, lli^^^ < n(i) • 

Then by the triangle inequality || r'/|| 2^I,e/l|Ac,ll2^fi(f)-0'^'^- □ 
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The next step should he to argue that the error in terms of the deficiency will he small. 
Recall that we still assume that containers are sorted so that 1 > s(Ci) > ${€ 2 ) > ... > siCs) > 0. 


Lemma 17. Let Ci be a container in small size class a. Then 


J^Acj{x-x) 

J^i 


<0 


Iff 


15/16 


IfCi is a large container, then T.j<i Acj [x- x) = 0. 

Proof. If Ci is a container in small size class a, we can write the interval {1,= U/eK/) I as 
the disjoint union of intervals X(i) e X from our collection so that the only intervals / e X{i) 
with A/ > 0 that we are using are from class a and we only take at most three intervals from 
each level; for all three such intervals on level £, we have || Vi \\2 < < K■ 2~^ 

Consequently we can hound 


J^Acjix-x] 

j^i 


X Ar\\vi\\2<Y.^^-K-2-^ 


Y N15/16 

.ff j 



15/16 


If Ci is a large container, we can write {1,/} as a disjoint union of intervals with A = 0, and 
so the statement holds. □ 


It remains to argue why def{x, y) < def(x, y) + 0(1) for one application of Lovett-Meka. 
First notice that AcjX = mult(Cy x) and AcjX = mult(Cy x). Therefore hy Lemmas [9] and 
[T7]the rounding of each size class a increases the deficiency hy at most 0(1) • (^)^^^^® • a = 
0(1) ■ Summing over all size classes gives a total increase in deficiency 

0(1)- Y. ff^^^'^^o(i). 
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Appendix A 

Here we give the proof of Lemma[3l Recall that for our result we would need f[k) = 0(log(k)). 

Proof. Let (s, b) be any bin packing instance with s e [0,1]" and beN". Let U := be 

the total size. Note that U < OPTf <2f/-i-l, soUisa good estimate on the value of the LP 
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optimum. We split items into large ones L := {i e [n] \ Si > and small ones S := {i e [n] | 

Now, we perform the geometric grouping from IKK82I to the large items as follows: sort 
items consecutively and form groups of total size between 2 and 3. Then for each group, 
round all items to the largest item type in its group. This procedure allows to reduce the 
number of different item types to U while the optimal fractional value increases to at most 
OPT'^ < OPTf + O(logL0. Now we run the assumed algorithm to assign items in L to at 
most OPT'j: + fiU) < OPTf + fiOPTf) + OClogLO bins. Here we are using that OPTf > U is 
an upper bound on the number of items in the modified instance and Smin is a lower 
bound on the item sizes in L. 

Then we “sprinkle” the small items greedily over those bins. If no new bin needs to be 
opened, we are done. Otherwise, we know that the solution consists of k bins such that k-l 
bins are at least 1 - full. This implies U > [k-1) ■ [I - jj), and hence k<U + 3< OPTf+ 3 
assuming U> 2. □ 
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